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1 Introduction 

Target Tracking is the problem of keeping track of a set of specified targets by means of a given 
\q ■ set of sensors. We study the target tracking problem in which the targets lie in the plane and the 

sensors are cameras also positioned in the plane. It requires two distinct cameras to estimate the 
position of a target. The quality of this estimation depends mainly on the relative position of the 
target with respect to that of the two cameras assigned to it [3, 2]. The field of view of a camera is 
a cone. A target can be tracked by a camera if it lies in this cone, and therefore a target tracked 
q I by a pair of cameras should lie in the intersection of their respective cones. 

Tracking a target in this manner cannot in general provide accurate estimates of position. Hence 
it is important to carefully pick and assign pairs of cameras to different targets so as to minimize the 
error in estimation. With this setting our problem can be viewed as a resource allocation problem. 
qq | For arbitrary error functions, this problem is NP-hard and hard to approximate - belonging to the 

class of Multi-Index Assignment Problems [6], but usually the error is some function of the geometry 
of the camera and target positions [4]. Other NP-hard versions of multi-index assignment problems 
also focus on geometry, such as those aiming to minimize the circumference or the area of a triangle 
formed by three assigned points in the plane [7]. 

The problem we thus consider is the Focus of Attention problem (FoA) which requires us to find 
a pairing of cameras and an assignment of camera pairs to a target in a manner that is optimum 
for some measure of tracking quality [4, 2]. In our work we shall consider the constrained geometric 
setting in which the cameras are stationed on a line in the plane. Past work on this problem has 
also focused on this constrained setting [4, 2] and it is likely to model well the scenario in which 
targets are at relatively large distances from viable camera positions. Although the cameras cannot 
move they can rotate and freely choose their viewing direction. Both the cameras and the targets 
are represented by points. 

We consider two geometric error metrics. The first is the 'Aspect Ratio" which is the ratio of 
the vertical distance of the target from the camera line to the distance between the two cameras 
dedicated to it. (Refer Figure 1(a).) This metric can be used to gauge the error in stereo recon- 
struction and gives a good approximation if the cameras are not too close to the target. It was first 
considered by [4]. 

The second metric is the "Tracking Angle Deviation From Right Angles", studied by Gfeller 
et al. [2], who state that for a pair of cameras tracking a target, the tracking accuracy is best if 
the angle at the target - the tracking angle - is closest to 90°. Thus the considered metric is the 
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(a) Aspect Ratio Objective Function (b) Tracking Angle Objective Func- 

tion 

Figure 1: Here cameras c% and Cj that are bij apart are assigned to target tk which is at a distance 
of Zk from the camera line 




Figure 2: Tracking angles are all less than 90° if targets are not too close 



deviation of the angle from this desired value. (Refer Figure 1(b).) If the targets are not too close 
to the cameras, we can assume a scenario in which the tracking angles are all small i.e. less than 
90°. In such a scenario minimizing the deviation from the optimum tracking angle is equivalent to 
maximizing the tracking angle. In this paper, we assume that all tracking angles are less than 90° 
and hence our objective is to maximize the tracking angles. 

We can assume without loss of generality that all the targets lie on one side of the line on which 
the cameras are placed. If they are not, we can consider their projections without affecting either 
of our optimization metrics. 

Formally, we define the two versions of the FoA problem we consider as follows: 
INPUT: A set T of n targets given as points in the plane and a set C of 2n cameras, given as 
collinear points on line I in the same plane. 

FEASIBLE SOLUTION: A camera assignment where each target is assigned to two cameras and 
each camera is assigned to exactly one target. 

MEASURE: (1) An Aspect Ratio and (2) A Tracking Angle for every triple consisting of a target 
and two cameras. 

GOAL: Find a feasible solution which is optimal for the sum of (1) aspect ratios and (2) tracking 
angles. 

In this paper we consider the problem of (1) minimizing the sum of aspect ratios - the MIN- 
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SUMOFRATIOS problem and (2) maximizing the sum of tracking angles - the MAXSUMOFAN- 
GLES problem. For this second problem, we assume that the input has the property that the 
tracking angle of every triple is at most 90°, that is, every target lies outside the Thales' circle of 
any possible camera pair (Figure 2.) 

Related Work. Target Tracking is an important research topic in the field of computer vision 
and image processing and has applications in environment surveillance and monitoring applications 
[8, 2]. 

Isler et al. [4] first studied target tracking by formulating the Focus of Attention (FoA) problem 
as a combinatorial optimization problem. The motivation behind their work was lowering the 
costs of optimum depth estimation. They showed that in a general setting (not in the plane) 
this comprises the classical NP-Hard 3- Dimensional Matching (3DM) problem as a special case. 
Therefore the focus in [4] is on the constrained geometric setting in which all cameras are restricted 
to lie on a single line I. The objective is the "aspect ratio" where Zv is the distance of target 
tk from I and bij is the distance between cameras Cj and Cj assigned to target tk (also called the 
baseline). They give a 2-approximation for the problem of minimizing the sum of aspect ratios 
(that is, the MINSUMOFRATIOS problem) and for the problem of minimizing the maximum 
aspect ratio. Also, if the cameras are placed equidistantly on the line, they present a PTAS for the 
MINSUMOFRATIOS problem. They also consider cameras on a circle and targets inside the circle 
with tracking cost being gjj^a, where 6 is the tracking angle, and deliver a 1.42- approximation for 
the problem of minimizing the the sum of tracking costs, and the maximum tracking cost. 

Gfeller et al. [2] show that the problem of minimizing the sum of the deviations of tracking 
angles from 90° (best tracking angle for accuracy) is NP-Hard, and that it admits no (multiplicative) 
approximation. For cameras on a line, they present a 2-approximation algorithm for the problem 
of maximizing the sum of tracking angles (that is, MAXSUMOFANGLES) and maximizing the 
minimum tracking angle (or the bottleneck angle) under the assumption that all tracking angles 
are less than 90°. Also, if the cameras are placed equidistantly on the line, they present a PTAS 
for the MAXSUMOFANGLES problem. Arkin and Hassin [1] give a 2 + j approximation for the 
problem of maximizing the sum of tracking angles with cameras lying on a line (Here, t is the size 
of the local neighborhood in the local-search algorithm). 

Our Contribution and Techniques. We consider the FoA problem that asks for camera as- 
signment with minimum sum of Aspect Ratios (MINSUMOFRATIOS) and FoA that asks for 
camera assignment with maximum sum of tracking angles (MAXSUMOFANGLES). For cameras 
on a line, (and targets not lying in the Thales' circle of any camera pair) we present a Quasi- 
PTAS for MAXSUMOFANGLES. For cameras on a line, we present a Quasi-PTAS also for the 
MINSUMOFRATIOS problem. A Quasi-PTAS is an algorithm that, for any constant < e < 1, 
returns a solution whose cost is within an additive e factor of the optimal and has time complex- 
ity n polylog ( n ). Thus we improve upon the constant factor approximations known for these two 
problems, using quasi-polynomial rather than polynomial time. 

It is evident that the powerful geometric structure underlying the Focus of Attention Problem 
sets it apart from the more general assignment problems and makes it interesting. This geometry 
has been exploited in various ways by [4, 2] to obtain efficient approximation algorithms. In 
particular in both these efforts the cameras are divided into two sets of equal size and classified as 
"left" cameras and "right" cameras. Also in the special case of cameras placed equidistantly on the 
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line both [4, 2] use the technique of further partitioning both the left and right cameras and then 
guessing the number of camera pairs in the optimal solution for each pair of blocks of the partition. 

We also make use of the concept of left and right cameras; we extend the partitioning technique 
to cameras spaced arbitrarily on the line by making use of an intelligent discretization process. The 
main idea is to partition the interval containing the left (and right) cameras into a small number 
of intervals called "buckets" , and guess the number of camera pairs between every pair of buckets 
in the optimal solution. 

In the MAXSUMOFANGLES problem, if we guess that there are T camera pairs involving 
buckets B and B', we cannot simply return T arbitrary pairs involving cameras in B and B'. Some 
of these pairs can be too "sensitive" for this crude process. Luckily, we get around this difficulty 
via a geometric observation about angles which implies that a sensitive pair can be sensitive at its 
-B-end or its B'-end but not both. 

For the MINSUMOFRATIOS, the issue of sensitive pairs does not arise, but the main difficulty 
is that there is no single "scale" of distances at which we can apply the discretization. (This 
difficulty would not arise if the ratio of the maximum inter-camera distance to the minimum inter- 
camera distance is polynomially bounded.) So we first apply the discretization at the scale of 
the median inter-camera distance (in the optimal solution), and recursively obtain and solve two 
independent instances of the problem with size at most n/2. A characterization due to Isler et 
al.[4] of the optimal solution for a fixed camera pairing with the targets turns out to be quite useful 
in making this approach work. 

Organization of the paper. In Section 2, we introduce some notation, review some important 
observations made in previous work on these problems, and give a high level overview of our 
algorithms. We present our algorithms for MAXSUMOFANGLES and MINSUMOFRATIOS in 
Sections 3 and 4, respectively. 

2 Preliminaries and Notation 

Suppose we have some horizontal line I and a set C of 2n cameras such that each camera lies on I. 
We call the iih camera on the line Cj. For q, Cj S C, we say q < c,- if Cj is to the left of Cj on I. We 
assume that no two cameras have the same position, and thus we have c\ < C2 < . . . < cm- We 
are also given a set T = {t±,t2, ■ ■ ■ , t n } of targets that lie in the plane. Without loss of generality, 
we can assume that all of the targets lie above I. The distance between cameras Cj and Cj is called 
the baseline of Cj and Cj and is denoted bi j. A camera pair is a set of two distinct cameras, and a 
camera pairing of a set C of 2m cameras is a set of m camera pairs such that each camera in C' 
appears in exactly one of the camera pairs. 

A pairing of cameras is all- overlapping if the baselines of any two camera pairs in the pairing 
intersect. Suppose there is a camera pairing that contains pairs ((H,Cj) and (c.j/,Cj/) such that 
Q < Cj < Ci> < Cj' (i.e. the pairing is not all-overlapping). It is easily seen [2] that we can do better 
for both objective functions if we use the pairs (cj,Cj/) and (cj,Cj>). Refer Figure 3. This leads 
to the observation [2] that there is an optimal camera pairing where every camera pair (q, Cj) has 
i € {1, 2, . . . , n} and j G {n + 1, n + 2, . . . , 2n}. Let M denote the midpoint of b n , n+ \. See Figure 
4. Another way to view this observation is that there is an optimal solution with camera pairing P 
such that for all pairs (cj, Cj) E P, c, is to the left of M and Cj is to the right of M. Our algorithms 
will only consider such camera pairings. 
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Figure 4: In this example n = 3, cameras c±, 02,03 are "left" cameras and the rest are "right 
cameras. M is the mid-point of baseline 63,4 formed by cameras 03 and C4. 
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We shall now present an overview of our algorithms. Although historically the "Aspect Ratio" 
objective function was considered first [4], it was shown in [2] that the "Tracking Angle" is in 
general the most influential way of tracking quality directly. Hence we shall consider this metric 
first. Both algorithms use a discretization procedure to aid in computing our solution. However, 
a crude discretization procedure will not suffice for either problem, and we must use a more clever 
technique. In the case of MAXSUMOFANGLES, a geometric observation comes to our rescue. In 
the case of MINSUMOFRATIOS, a more sophisticated discretization process comes to our rescue. 

First consider the MAXSUMOFANGLES problem. Our algorithm constructs a small number 
of camera "pairings" - (a pairing is simply the set formed when each camera is paired with exactly 
one other camera). Given this set of camera pairings, we then are able to determine which camera 
pairing is the best by using a polynomial time algorithm for minimum-weight perfect matching on 
a bipartite graph in which one set of vertices is the camera pairs [5] and the other is the targets , 
as observed by Gfeller et al. [2]. 

Suppose we fix some optimal solution that uses some all-overlapping camera pairing P. That 
is, every camera pair in P involves one camera to the left of M and one to the right of M. We 
discretize the line into a small number of buckets such that the length of each bucket is small 
compared to its distance from M. Consider a bucket on one side of M and another bucket on the 
other side of M. Suppose we are able to correctly guess that the number of camera pairs in P 
that have one camera in each of these buckets is /i. We would like to be able to arbitrarily pick 
\i cameras from the first bucket and pair them with an arbitrary ^ cameras in the second bucket 
and argue that the tracking angle formed by these camera pairings at the target will not be too 
small as compared to the one formed by the corresponding pair in P. Unfortunately, this approach 
will not work, as the arbitrary assignment might cause some tracking angle to be more than an e 
factor smaller than the corresponding angle in P. To get around this obstacle, we state and prove 
a geometric lemma (Lemma 2) which allows us to handle these "sensitive" cases. The algorithm is 
then to guess the number of such "sensitive" cases that arise for each of our buckets. We handle 
those camera pairs (relying heavily on our lemma), and then we use the more crude technique on 
all of the remaining cameras falling in buckets. If any cameras still remain to be paired, we show 
that there is an easy way to pair them off (again making use of the same lemma). 

Now consider the MINSUMOFRATIOS problem. Suppose we fix some optimal solution that 
uses some all-overlapping camera pairing P. Let b % denote the ith baseline in P when the baselines 
are indexed such that b 1 < b 2 < ■ ■ ■ < b n . The high level idea of our algorithm is that we will guess 
the length of the median baseline (i.e. b n ^ 2 ) and then guess all camera pairs in P whose baselines are 
within a polynomial factor of b n l 2 . To aid us in guessing, we use a discretization procedure. This 
procedure allows us to make all possible guesses in quasi-polynomial time while not doing too much 
worse than what the optimal solution would have done. Suppose that we make a correct guess. 
Then we will have correctly guessed all camera pairs in P whose baselines fall within some interval 
[i^fr 3 ] for b % < b n l 2 < W up to an e factor. We then recursively repeat this guessing procedure for 
baselines in the interval [fr 1 ,^ -1 ] and the interval [V + , b n ]. The correctness of the algorithm then 
follows by showing that the errors do not accumulate too much over the course of the recursion. 

Fix a camera pairing P and a set of targets T such that |P| = \T\. The optimal association of 
camera pairs in P with targets in T is rather easy to compute, using a characterization given by 
Isler et al. [4]: 

Lemma 1. Let Zi be the distances of targets in T from line I, Z\ < Z<i < • • • < Z m and b l be the 
baselines in P sorted such that b 1 < b 2 < • • • < b m . There exists an optimal matching such that the 
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target at depth Z% is assigned to the pair with baseline b l . 



We denote the cost of such an assignment of a pairing P to targets T as cost (P,T) and we 
compute it according to the assignment in Lemma 1. 

3 Maximizing the Sum of Angles 

We now describe our approximation algorithm for the MAXSUMOFANGLES problem. We adopt 
the notation of Section 2 - the camera locations are c\ < C2 < ■ ■ ■ < C2n on line I , with c n < 
M < c n +i. The targets are points in the plane above the line, and denoted t\, . . . , t n . Let e > be 
the given approximation parameter; we may assume that e < 1/2. Our algorithm outputs a small 
number of camera pairings of C. One of the camera pairings that we output will have the property 
that it is possible to associate each of the camera pairs in it with targets in such a way that the 
sum of the angles at the targets is at least (1 — e) times that of OPT, where OPT is the value of 
the sum of tracking angles in an optimal solution. For a given camera pairing, we can compute the 
best association with the targets using a polynomial time algorithm for minimum-weight perfect 
matching [5] , as observed by Gfeller et al. [2] . We can therefore evaluate all of the camera pairings 
that we output, and return the best one. 

Our algorithm makes use of a subroutine for partitioning an interval to achieve a result of the 
following form: Given < 71 < 72, partition the interval [M + 71 , M + 72] (resp. [M — 72, M — 71] 
) into buckets (sub- intervals), so that the distance between the left (resp. right) endpoint of each 
bucket from M is at least \ times the bucket length. We will call a partition of [M + 71, M + 72] 
(resp. [M — 72,-^ — 71] ) with this property a conforming partition. Clearly, we can compute a 



buckets, then partition [M + 271, M + 471] into 1/e 2 equal sized buckets, and so on until we are 
past M + 72. We also adopt the following notation for the algorithm: L will denote the cameras 
to the left of M, and R the cameras to the right. During the course of the algorithm, some of the 
cameras get paired, and L (resp. R) will always stand for the unpaired cameras. Given a bucket B 
to the left (resp. right) of M, we will let \B\ denote the number of cameras in the current L (resp. 
R) that fall in bucket B. Finally, let a = M — c±, and d = C2 n — M. In describing the algorithm 
and analysis, we will assume a < d; the other case is symmetric. See Figure 5 for an illustration. 

The Algorithm. The following algorithm outputs one candidate pairing of the cameras C for 
each combination of the functions a, tt, /i, and A considered. We then evaluate each of these pairings 
based on the stated criterion (MAXSUMOFANGLES) and return the best one. 

Running Time. We will now bound the running time of the algorithm by bounding the number 
of possibilities that the mappings a, tt, and [i consider. We can bound the number of cameras in 
any bucket by n. Thus we can bound the number of possibilities for a by n°( k \ and we can bound 
the number of possibilities for ir by n°^\ We can bound the number of possibilities for fj, by n°^\ 
Because the number of buckets j and k are bounded by O ^ lo g(^A) ^ ; we have that the running 
time of the algorithm is bounded by 




buckets: partition [M + 7i,M + 271] into 1/e 2 



equal sized 




This expression also bounds the number of candidate pairings of C that are output 
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(M-4 71 ) (M-2 7 i) (M-71) (Af + 71) (M + 271) (M + 4 71 ) (M + 871) 



buckets of equal length 



buckets of equal length 



Figure 5: Illustration for a conforming partition (the intervals on I are not drawn to scale). Each 
of the shown subintervals of I (except for [M — 71, M] and [M, M + 71]) are divided into buckets 
of equal length. 




Figure 6: Illustration for Lemma 2. For a-i > ai, > 1^4, 



Approximation Factor. Let oct denote the target angle corresponding to target t £ T in the 
optimal solution OPT. We now show that there is one choice of the functions a, ir, fi, and A for 
which it is possible to associate the candidate pairing output by our algorithm to the targets in 
such a way that the target angle (3t in this solution for any target t satisfies 



Pt>a t - 4max{ea t , A^y^a t }. 



(1) 



Adding this inequality for each t £ T gives ^2 t Pt > (1 — 4e)^ t «t, establishing the algorithm's 
correctness. In order to establish this inequality, we will find it convenient to move the cameras 
as part of the analysis; this will be safe because we only move towards M. For each target t, let 
l(t) and r(t) initially denote the cameras associated with t in OPT to the left and right of M, 
respectively. As we modify the association of targets with camera pairs in the analysis, l{t) and 
r{t) will change. At all times, fit will denote the angle l(t)tr(t). The following geometric lemma, 
whose proof we present in the appendix, plays a crucial role in the analysis. Refer Figure 6. 
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Algorithm 1 

1: Let i?o be the bucket [M — j^?, M]. Let B\, . . . , B^ be the buckets resulting from a conforming 

partition oi[M-a,M- j^}. 
2: for each map a : {B\, . . . , B^} — > Z + such that a(Bi) < \Bi\ do 

3: For each 1 < i < k, pair the cr(Bi) leftmost cameras in Bi with the cr(Bi) leftmost cameras 
in R arbitrarily. 

4: Let B' be the bucket [M, M + ] ■ Let B[, . . . , Bj be the buckets resulting from a con- 
forming partition of [M + Af + f ]. 
5: for each map tt : {B[, . . . , Bfi -> such that cr(^) < |J3<| do 

6: For each 1 < i < j, pair the ^(B^) rightmost cameras in B^ with the ^(B^) rightmost 

cameras in L arbitrarily. 
7: for each map /U : {Bq,...,B}.} X {B' ,...,B'j} — > -E + such that Y^l^i^ii^'l) — f° r 

every i < /c and n(Bi, B^) < {B'^ for every i < j do 
8: Go through the (Bi,Bi) pairs in any order and arbitrarily pair ^{Bi^B'g) cameras from 

Ln Bi with fi(Bi, B'f) cameras from Rf] B' t 
9: for each < A < \B\ where B is the bucket [M - a, M] do 

10: Pair the A leftmost cameras in L n B = L with the A leftmost cameras in R arbitrarily. 

11: Pair the remaining cameras in L arbitrarily with cameras in R. 



Lemma 2. Let x,y,t be three non-collinear points in the plane such that angle(xty) = 9. Let z 
and z' be two points on the line xy such that both z and z' lie between x and y and angle{xtz) = 
angle(ytz') = e-0. Recall that < e < 1/2. Then the ratio < where \xz\, \xz'\ are the lengths 
of segments xz, xz' respectively. 

Fixing a. In this step, let us call a target t sensitive if l(t) lies in bucket Bi for i > 1 (so not in the 
first bucket Bq), and the angle l(t)tp is at least e times the angle (3t = l(t)tr(t), where p is the right 
endpoint of bucket Bi. Since the buckets come from a conforming partition, Lemma 2 implies that 
for a sensitive t the angle l(t)tc for any camera c to the right of M is at least (1 — e)f3f (The upshot 
is that we have to be careful in changing l(t), but we have considerable flexibility with r(t).) Fix a 
so that cr(Bi) is the number of sensitive targets t with left endpoints within B^. With this choice of 
a, recall that our algorithm fixes Y2« camera pairs at this stage. Now for each sensitive t, reset 

l(t) so that it lies to the left of (or is the same as) the original l(t) and is one of the cr(Bi) leftmost 
points in its bucket Bi, and reset r(t) to be the camera that is paired with the new l(t) in the 
partial camera pairing that is fixed. It follows that now fit > (1 — e)ctt > at — max{e«t, -% Y2t a t} 
for all sensitive t. The association of sensitive t with the camera pairing we output is finalized now, 
and hence Inequality 1 holds for such t. 

For the t that are not sensitive, reset l(t) to be some other camera in the same bucket as the 
original l(t) and reset r(t) to be a camera that is the same or to the right of the original r(t). 
(During these resettings, we always ensure that each camera is l(t) or r(t) for exactly one t.) Now 
"move" the camera that is l(t) for each such t to the right endpoint of the bucket containing l(t). 

For a t that is not sensitive and for which l(t) does not lie in bucket Bq, it is clear that the new 
f3t is at least (1 — e) times the original fit- Now if l(t) lies in Bq, letting p\ and p2 denote the left 
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Figure 7: A figure to illustrate that angle(pitp2) is very small compared to angle(cite2 n ). 



and right endpoints of Bq, we argue that now 

fit > <*t ~ angle(pitp 2 ) > a t - angle (cifc 2 „) > a t - ^ X^ a *' 

For the second inequality, we show that angle(pitp2) < ;^angle(ciic2 n ); we omit from this version 
the somewhat tedious argument for this claim, but note that it is here that we use the fact that t 
is outside the circle with diameter c\C2 n - See Figure 7 for a figure to provide intuition for why the 
claim holds. 

Thus, for the t that are not sensitive, we have 

0t>a t - max{ea ( , 4j a t }. 

t 

We move on to the next stage with the targets t whose association with camera pairings has 
not been finalized (the targets that were not sensitive in this stage), along with the cameras l(t) 
and r(t) for such targets (the cameras that have not been paired up by our algorithm). 

Fixing 7T. This is quite symmetric to the previous step. Among the targets that have moved on 
to this step, let us call a target t sensitive if r(t) lies in bucket B[ for i > 1 (so not in the first 
bucket -Bq), and the angle ptr(t) is at least e times the angle fit = l(t)tr(t), where p is the left 
endpoint of bucket B\. Since the buckets come from a conforming partition, Lemma 2 implies that 
for a sensitive t the angle ctr(t) for any camera c to the left of M is at least (1 — e)f3t- Fix tt so that 
7r(i?-) is the number of sensitive targets t with r(t) within B[. With this choice of tt, recall that our 
algorithm now fixes n(B'i) camera pairs at this stage. Now for each sensitive t, reset r(t) so that 
it lies to the right of (or is the same as) the original r(t) and is one of the vr(-B-) rightmost points in 
its bucket B[, and reset l{t) to be the camera that is paired with the new r(t) in the partial camera 
pairing that is now fixed. It follows that for all sensitive t, the current (3t is at least (1 — e) times 
the f3t at the end of the previous step, and thus > at — 2max{eat, ^ Y2t ^ ne association 
of sensitive t with the camera pairing we output is finalized now, and hence Inequality 1 holds for 
such t. 

For the t that are not sensitive, reset r(t) to be some other camera in the same bucket as the 
original r(t) and reset l(t) to be a camera that is the same or to the left of the original l(t). Now 
"move" the camera that is r(£) for each such t to the left endpoint of the bucket containing r(t). 
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For a t that is not sensitive, we can argue along the same lines as in the previous step to conclude 
that 0t>OL t - 2max{eat, ^ £) t a t }. 

We move on to the next stage with the targets t whose association with camera pairings has 
not been finalized (the targets that were not sensitive in this stage), along with the cameras l(t) 
and r(t) for such targets (the cameras that have not been paired up by our algorithm). 

Fixing /i. For every pair < i < k and < i! < j, fix n(i,i') to be the number of surviving 
targets t with l(t) € B; t and r(t) £ B'-,. With this choice of fj,, recall that our algorithm outputs 
fj,(i,i') pairs of cameras at this stage with one endpoint in B{ and the other in B' v . Reassign the 
l(t) and r(t) values for the surviving targets (this reassignment should not cause an l(t) or r(t) 
value to move to a different bucket), if necessary, so that these pairs that are output by 

the algorithm are assigned to the targets that led to the definition of fj,(i,i'). The association of 
such targets with the camera pairing that we output is finalized at this stage. Notice that ftt does 
not change for any target in this step, whether it is finalized or not. This is because the cameras 
have already been moved to endpoints of the bucket they belong to. In particular, this means that 
Inequality 1 holds for the targets that do not survive this stage. 

We now move to the final stage with the targets that survive. Note that the l(t) for such targets 
belong to the bucket B = [M — a, M], and the r(t) >M + j I . 

Fixing A. In this step, call a surviving target t sensitive if angle (l(t)tM) > e-angle(Z(i)tr(i)) = e(3f 
Lemma 2 implies that for a sensitive t the angle l(t)tc for any camera c to the right of M + ^ is 
at least (1 — e)/3t- Fix A to be the number of such sensitive targets t. With this choice of A, recall 
that our algorithm fixes A camera pairs at this stage. Now for each sensitive t, reset l(t) so that 
it lies to the left of (or is the same as) the original l(t) and is one of the A leftmost points in its 
bucket B, and reset r(t) to be the camera that is paired with the new l(t) in the partial camera 
pairing that is fixed. It follows that for sensitive t, the new fit is at least (1 — e) times the fit before 
this step, and thus fit > a t — 3max{eaj, A Ylt a t\- The association of sensitive t with the camera 
pairing we output is finalized now, and hence Inequality 1 holds for such t. 

For the t that are not sensitive, reset l(t) to be some other camera in B, and reset r(t) to be a 
camera that is the same or to the right of the original r(t). Move the camera l{t) to the point M. 
The new @ t is at least (1 — e) times the fit before this step, and thus fit > ®t — 3max{eat, -A Ylt a *}- 

By reassigning l(t), if necessary, but not the r(t), the camera pairs that the algorithm outputs 
now will have the form (l(t),r(t)) for each surviving t. This completes the association for all the 
targets. This reassignment does not change fit because each l(t) is on point M, and Inequality 1 
holds for every t. 

This completes our argument about the approximation guarantee. 

Theorem 3. There is an algorithm for the MAXSUMOFANGLES problem that, for any < e < 1, 
runs in quasi-polynomial time and returns a solution in which the sum of tracking angles is at least 
(1 — e) times that in the optimal solution. 

4 Minimizing the Sum of Ratios 

We now present our algorithm for the MINSUMOFRATIOS problem. Recall that in this problem 
the cost of assigning a camera pair (cj,Cj) to target t^ is ^ where is the vertical distance 
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between tk and the line I containing the cameras, and bij is the baseline corresponding to (distance 
between) c, and Cj. We would like to minimize the sum of these costs. In this section, we present 
an algorithm that returns a solution whose cost (sum of aspect ratios) is no worse than (1 + e) 
times the same in the optimal solution, in quasi-polynomial time for any e > 0. 

The Algorithm. Our algorithm is recursive and is formally defined as Algorithm 2. The algo- 
rithm takes as input a set of 2m cameras X, a set of m targets Y, and two positive real numbers £ 
and u. The cameras are indexed x\,x<i,... X2 m such that x\ < X2 < ■ • • < X2 m and x m < M < x m+ \ 
and the targets are indexed yi, j/2> ■ ■ ■ Um such that Z\ < Z2 < • • • < Z m . The algorithm will return 
a camera pairing P of the cameras in X such that the baselines of all of the pairs in P are at 
least £ and are at most u. If no such pairing is possible, our algorithm returns a "dummy" camera 
pairing /; for ease of description, we define cost(P, T) = 00 for any pairing P that contains /. Our 
algorithm breaks up the line I into intervals called buckets. We will use \d\ to denote the number 
of cameras in X that lie in bucket d. We assume < e < 1/2. 

Algorithm 2 minRatioPair(X, Y, £, u) 
1: If I X I = 0, return the empty set. 

2: Let C be the set of all baselines bij such that 1 < i < m, m + 1 < j < 2m, and £ < bij < u. 
3: Set local variable best <— 00, and Pbest I- 
4: for each /3 £ C do 

5: Partition the interval [M — 2n(3,M] into O(^p) buckets in the following way. Initialize 
r = M. 

6: for i = —1, 0, 1, . . . , 2 log n do 

7: Discretize the interval [r — 2 l ^,r] into - buckets of equal length. 

8: r<-2*£ 

n 

9: Symmetrically partition the interval [M,M + 2n(3] into 0(^|^) buckets. 
10: Let B\ denote the set of all buckets to the left of M, and let B2 denote the set of buckets to 
the right of M. 

11: for each map [i : B\ x B2 — > Z + and each map a : B\ U B2 — > Z + such that <r(B) + 

T,B'eB 2 l J -( B ' B ') ^ \ B \ for each B e Bl > °"(- B + ^BeB! V( B > B ') < \ B '\ for each B' € B 2 , 
and EsgBi = Es'eB 2 CJ (- B/ ) do 

12: X s h ort <— 0, X m jrf <— 0, Xi on g <— 0, P m jrf <— 

13: Go through each (B, B') £ Bi x B2 in any order and pair the fJ>(B, B') cameras in B that 
are closest to M with the n{B, B') cameras in B' that are closest to M and place these 
pairs in P m id- Add these cameras to X m id and remove them from their respective buckets. 

14: For each bucket B € B\ U B2- Of the cameras that remain in B, place the cr{B) cameras 
that are farthest from M into X s /j 0rf . Place all other remaining cameras in B into Xi ong . 

15: Denoting m s = |Xs ^° rtl and m mid = let Y s/lort = y 2 , • • • , fe,}, Y mid = 

{ym s +i, ym s +2) • • • ■> 2/m s -i-m mid }) and Y\ on g = {y ms _|_ mmid _|_i , . . . , y m }. 
16: if |Kjhort| < m/2 and |l^ D ng| ^ m/2 then 

17: P <- P mi(i U minRatioPair(X s/lort , Y short ,£, (1 + e) ^) U minRatioPair(X io „ 9 , l^ ong , (1 - 

e)2n/3, u(l + e)) 

18: If cost(P, Y) < best, then best <— cost(P, Y) and Pbe^f <— P. 

19: Return Pbest- 
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Figure 8: Illustration for the discretization process. The value in each interval denotes the length 
of the interval. Each interval is divided into | buckets of equal length. 
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Figure 9: Illustration for the step in algorithm where the bucket pair (B,B') contributes to X m id- 
The filled in cameras will be paired up and placed into X m id for fJ,(B, B') = 2 



For solving the input instance of MINSUMOFRATIOS, we invoke minRatioPair(C, T, b n n+ \ , bi^n)- 
For this invocation, note that the depth of the recursion is at most logn, because the size of Y 
falls by a factor of 2 with each recursive call. It will also be useful to note that when we make a 
recursive call, the lower bound t for the recursive call is not smaller than the original t\ the upper 
bound u for the recursive call can be larger than the original, but only by a factor of (1 + e). 



Running Time. We will now show that Algorithm 2 runs in quasi-polynomial time when the 
instance minRatioPair(C, T, & n ,n+l> &i,2n) is invoked. The set C contains 0(m 2 ) elements. The 
number of buckets in B\ U £>2 is 0(^|^). This means that the number of choices of fi and a is at 

qi l°g^ n \ \C( log2 , n ) C ( log2 , - ) 

most [m + 1) y t' 1 ' . Thus we make (m + 1) v ? ; = n v ^ ' direct recursive calls. Since the 
depth of the recursions is at most logn, this gives us our quasi-polynomial running time. 



Approximation Ratio. We now give an argument for the approximation factor guaranteed 
which, though informal, highlights the main issues. Consider the optimal pairing Bopt for the 
input instance with cameras C and targets T, and the optimal association of targets in T with 
cameras in Bopt- We show we can associate each target t with a camera pair in such a way that 
(a) these camera pairs form a camera pairing of C, (b) the baseline of the camera pair associated 
with each t is at least (1 — 5e) times the corresponding baseline in Bopt, and (c) our algorithm 
outputs a solution that is at least as good as this association. Thus, we obtain our (1 + 0(e)) 
approximation factor. 

Let us describe how this special association is constructed. Let us start with the optimal 
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association of each t £ T with the corresponding camera pair in Popt, and consider the invocation 
minRatioPair(C, T, 6 n)n +i, &i,2n)- Consider the situation where the algorithm chooses (3 to be the 
median baseline in Popt- With this (3 it computes bucket sets B\ and Bi- The pairs in Popt can be 
split into three sets: those whose baselines are strictly smaller than (3/2n (the short baselines), those 
whose baselines are between /3/2n and 2n(3 (the medium baselines), and those whose baselines are 
strictly larger than 2n(3 (the long baselines). Consider the algorithm's choice of \x so that ii{B, B') 
equals the number of medium baselines with endpoints in B and B', and the choice of a so that 
(j{B) equals the number of cameras in B that are endpoints of short baselines. Notice that with 
this choice of \i the algorithm constructs sets Y short-, Y m ^ and Y[ ong of sizes m s = ^ g (j(fi), 
rrimid = Y.{b,b')^( B i B ')i and m- m s - m mid , respectively, and sets X short , X mid , and Xi ong of 
sizes 2m s , 2m m id, and 2{m — m s — rrimid), respectively. It also constructs a pairing P m id of the 
points in X mi d- We show that from Popt we can obtain also a pairing P s hort of X^^t and a pairing 
Piong of Xi on g, and modify our initial association of targets so that: 

1. Each target in Y m id is associated with a pair in P m id which is at most (1 — e) times shorter 
than the pair it was initially associated with. 

2. Each target in Y s h ort is associated with a pair in P s hort that is at least as long as the pair it 
was associated with. 

3. Each target in Yi ong is associated with a pair in Pi on g that is shorter than the original pair it 
was associated with by at most an additive e2n[3. 

(In addition, baselines in P s h or t have length at most (l+e)<fr, and this explains the upper bound 
in the recursive call minRatioPair(X s /j 0rt , F s /iort> i, (1 + e)^-). Similarly for the other recursive call.) 

At this point, we have finalized our special association for targets in Y m id- This is the association 
in (1) above; notice that we lose only a (1 — e) factor. For the targets in Y s h or t, we "recursively" 
construct the special association in this way starting with the new association with P s hort and 
following the recursive call minRatioPair(X s / lort , Y s h or t, (1 + Similarly for the targets in 

Yl on g. 

Notice that the baselines associated with targets in Y s h or t have not shrunk; the baselines asso- 
ciated with targets in Yi ong may have shrunk, but by at most an additive e2n(3. Now for a given 
target t, how much shrinkage can it experience as we recursively construct our special association? 
This is no more than the sum of the e2nf3 terms over all the /3's that it sees and for which it is 
in Yi ong . Now, Lemma 4 implies that this can be bounded by a geometric series that is at most 
4en/3 for the largest (3 it is in Yi ong for. And this in turn means that in our special association, t is 
associated with a baseline that is at least (1 — 5e) times the corresponding baseline in Popt- 

Lemma 4. Consider two recursive calls made by the algorithm where the second recursive call is 
nested (possibly by several levels) within the first. Let (3\ denote the choice of (3 in the first call 
within which the second call is contained, and let 02 denote any choice of (3 within the second call. 
Then either (3 2 > 2/3i or f3 2 < (3i/2. 

Proof. The immediate recursive calls that we make from the first call with j3\ either have an upper 
bound of (1 + e)^ or a lower bound of (1 — e)2nf3\. Now lower bounds do not decrease with 
recursion in our algorithm, so (3 2 > (1 — e)2n/?i > 2/3i (e < 1/2) if the second call is nested within 
an immediate call of the latter type. Upper bounds may increase with recursion but only by a factor 
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of (1 + e), and since the depth of recursion is less than logn, we have 02 < (1 + e) logn f^ < /?i/2 if 
the second call is nested within an immediate call of the former type. □ 

Theorem 5. There is an algorithm for MINSUMOFRATIOS that, for any parameter < e < 1, 
runs in quasi-polynomial time and returns a solution whose cost is at most (1 + e) times that of the 
optimal solution. 
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A Proof of Lemma 2 

Denote angle (txy) by a. For greater values of a, the rays ty and xy meet further apart and in 
the limiting case i.e. for a = ir — 8 these become parallel. Thus, we are interested in the range 
< a < 7r — 9. First we shall show that within this range as a increases, the ratio j^j- strictly 

increases. Then we shall show that in the limiting case - i.e. when a — ► ir — 6, the ratio t— r — > \- 

° ' \XZ\ €* 

This will complete the proof. 

For the first part, refer back to Figure 6. Here we consider two different values of a such that 
< a.\ < «2 < 7T — 6. Let yi, z\, z[ and y2, zi, z' 2 denote the corresponding points as defined before 
for these two cases. 
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Now consider the line through z[ that is parallel to xy2- Let this line intersect the line tz2 at 
point w. Now clearly Az±xz2 and Aziz[w are similar since two of their angles are equal. Hence, 
\zjw\ _ l - zi^il _ g u |. c x ear iy \z2z' 9 \ > \z\w\. Hence, we have f 2Z ^ > ^—4. This clearly implies 

| X 2^2 1 | Z\ X | | XZ% | | XZ\ | 

r^4 > 1^4. Thus as a increases our ratio increases. 

\xz 2 \ \xzi\ 

For the second part, refer to Figure 10. We consider the limiting case in which a = tt — 8. 

As we observed before in this case rays ty and xy become parallel. Let y' be the point on ty 
at a distance \xz'\ from t, so that we have the parallelogram txz'y'. In this scenario, Atxz is 
similar to Aty'z' (The Figure is not to scale). This is because angle(xtz) = ang\e(y'tz') = e • 8 
and angle(ixz) = angle(iyV) since these are opposite angles of parallelogram txz'y' . Now, due 
to similarity of triangles Atxz and Aty'z', -rrlr = -q—n- Hence, \ty'\ = MliE-L!.. But note that 

\y z \ \ty | l-E^I 

\ty'\ = \xz'\ and \y'z'\ = \tx\ since these are opposite sides of the parallelogram. 

\xz f I I tx\^ \tx I ^ 

After plugging in these values we have = = L^W- By applying the sine rule in 

Atxz, we have — - = \^\ — ^_ no t e that angle(xtz) = e ■ 8 and angle(tzx) = 

' smangle(i2x) smangle(xtz) & \ ; & v / 

angle(zty) = (1 — e) • 8 since these form a pair of alternate angles between parallel lines ty' and xz' 

cut by transversal tz. 

Hence we have (g)' = < < 4, 

Thus g < £ 
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